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Abstract. In this paper we introduce new models of random graphs, arising from Latin 
squares which include random Cayley graphs as a special case. We investigate some 
properties of these graphs including their clique, independence and chromatic numbers, 
their expansion properties as well as their connectivity and Hamiltonicity. The results 
obtained are compared with other models of random graphs and several similarities and 
differences are pointed out. For many properties our results for the general case are 
as strong as the known results for random Cayley graphs and sometimes improve the 
previously best results for the Cayley case. 



1. Introduction 

The concept of random graphs is a very important notion in combinatorics. Although 
there are several models of random graphs, by a random graph one usually refers to the 
model &(n,p), the probability space of all graphs on [n] in which every edge appears 
independently with probability p. For standard results on random graphs we refer the 
reader to the textbooks of Bollobas [7] and Janson, Luczak and Rucihski [14]. 

In this paper, we introduce new models of random graphs and study some of their 
properties with particular interest in their relation to the model &(n,p). Our models 
arise from Latin squares. Given a group, one can obtain Latin squares by considering its 
multiplication table or its division table. It turns out that the random graph obtained by 
the division table of a group G, is exactly the random Cayley graph of G (with respect 
to a random subset S of G.) 

Before defining our models, let us recall that a Latin square of order n is an n x n 
matrix L with entries from a set of n elements, such that in each row and in each column, 
every element appears exactly once. Given a Latin square L with entries in a set A of 
size n, and a subset S of A, we define the Latin square graph G(L, S) on vertex set 
[n], by joining i to j if and only if either 6 S, or Lji G S. 

Suppose we are given a sequence (L n ) of Latin squares of order n, with entries in [n], 
say. Choosing S C [n] by picking its elements independently at random with probability 
p, we obtain a random Latin square graph G(L n , S). We denote this model of random 
Latin square graphs by @(L n ,p). A related model is obtained by choosing a multiset S 
of k elements of [n] by picking its elements independently and unifomly at random (with 
replacement). We denote this model by &(L n ,k). Note that our underlying graphs are 
simple. However, for the model &(L n , k) it will be convenient for some of our results 
to retain multiple edges and loops. When we do this, we will denote this new model 
by £f m (L n , k). To be more explicit, in this model the number of edges joining i to j is 
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exactly the total number of times that Ljj and Lji appear in S. In particular, every 
G E & m (L n , k) is a 2k- regular multigraph. 

A similar model is obtained by looking at the complement of the graph G G &(L n ,p). 
We denote this model by &(L n ,p). In general, this model is not the same as ^(L n , 1 — p), 
the reason being that Ly is not necessarily equal to L^. However, usually it is not too 
difficult to translate results from one model into the other, so we will only concentrate 
on G e &(L n ,p). 

Note that, as mentioned above, our models include random Cayley graphs as a special 
case. Indeed, given a group G, consider the Latin square L defined by L xy = xy" 1 . Then, 
given any subset S of elements of G, the Latin square graph G(L, S) is exactly the Cayley 
graph of G with respect to S. The multiplication table of a group is also a Latin square, 
giving rise to what is usually known (motivated by the abelian Cayley sum 

graph. So this model includes random Cayley sum graphs as well. 

We should mention here that there are several differences between random Cayley 
graphs and our more general models of random Latin square graphs. For example, random 
Cayley graphs are always vertex transitive. On the other hand a random Latin square 
graph, even if it arises from the multiplication table of a (non-abelian) group, might not 
even be regular. However, it is easy to see that random Latin square graphs are not far 
from being regular in the sense that the ratio of maximum to minimum degree is bounded 
above by 2. 

The fact that random Latin square graphs are almost regular (in the above sense) 
motivates also the comparison of our models with & njr , the probability space of all r- 
regular graphs on n vertices taken with the uniform measure. (As usual, it is always 
assumed that rn is even.) 

Sometimes, it is easier to work with random Cayley graphs or random Cayley sum 
graphs for abelian groups, rather than random Latin square graphs. This is because we 
always have Lij = L^ 1 in the case of Cayley graphs, and L^ = Lji in the case of Cayley 
sum graphs, and so dependences between the edges can be easier to deal with. This 
sometimes leads to sharper results for the first two families of random graphs than for 
general random Latin square graphs; however we have opted to state our results only in 
the general case of random Latin square graphs. 

It seems that the general class of random graphs arising from Latin squares have not 
been studied before. However there has been much interest in random Cayley graphs 
and random Cayley sum graphs. For example, Agarwal, Alon, Aronov and Suri [1] 
established an upper bound on the clique number of random Cayley graphs arising from 
cyclic groups and used it to construct visibility graphs of line segments in the plane 
which need many cliques and complete bipartite graphs to represent them. In their study 
of a communication problem, Alon and Orlitsky [5] proved a similar upper bound for 
random Cayley graphs arising from abelian groups of odd order. Green [12], using number 
theoretic tools, studied the clique number of various Cayley sum graphs and showed that 
some of them are good examples of Ramsey graphs while others are not. The diameter of 
random Cayley graphs with logarithmic degree was studied by Alon, Barak and Manber 
in [2]. Alon and Roichman [G] proved that random Cayley graphs (on sufficiently many 
generators) are almost surely expanders, a result which was later improved by several 
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authors [18, 19, 8]. The fact that random Cayley graphs are expanders has several 
consequences for the diameter, connectivity and Hamiltonicity of such graphs. Finally, 
some other aspects of the diameter, connectivity and Hamiltonicity of random Cayley 
graphs and random Cayley digraphs were studied in [23, 22, 20, 21]. 

In this paper we extend many of these resutls to the general case of random Latin square 
graphs and show that the structure of the Latin squares have a non-trivial influence on 
many properties of random Latin square graphs. In Section 2 we state and discuss our 
main results regarding random Latin square graphs. We prove these results in Section 3, 
Section 4 and Section 5. In Section 6 we give further examples and open problems. 

2. Statements and discussion of the results 

In this section, we list our main results and make a few comments about them, com- 
paring them with the corresponding results in the &(n,p) and £f n;r models. In Subsec- 
tion 2.1, we will be interested in the maximum size of cliques and independent sets in 
& n ,p, as well as the chromatic number of £f„ iP and its complement. In Subsection 2.2, we 
will be interested in the expansion properties of random Latin square graphs as well as 
several consequences of these properties regarding connectivity and Hamiltonicity. For 
the results of this subsection it will be easier to work in the the models £f m (L n , k) and 
9(L n ,k). 

2.1. Cliques, independent sets and colouring. We begin with an upper bound on 
the clique number of random Latin square graphs. It is well known that the clique number 
of £f (n, 1/2), is whp asymptotic to 2 log 2 n. For the case of dense random regular graphs, 
it was proved in [17] that the clique number of < ^ n ,n/2 is whp asymptotic to 21og 2 n. 

Guided by the above results, one might hope to prove that the clique number of 
&(L n , 1/2) is whp 0(logn). However, it turns out that this is not the case. Green [12] 
proved that the clique number of the random Cayley sum graph on Z™, with p — 1/2, is 
whp G (log n log log n), where n = 2 m = In the same paper, Green proved that the 

clique number of the random Cayley sum graph on Z n , with p = 1/2, is whp O(logn). 
This shows that, in general, results about the model &(L n ,p) can depend on the actual 
sequence of Latin squares chosen. 

To the best of our knowledge, the best known general result on the clique number is due 
to Alon and Orlitsky [ ], which says that the clique number of a random Cayley graph 
arising from an abelian group of odd order n is whp 0((logn) 2 ). Using similar methods, 
we have managed to show that the same bound is in fact true for random Latin square 
graphs. In particular, it is also true for random Cayley graphs arising from non-abelian 
groups. We believe but cannot prove that the 2 in the exponent can be reduced further 

Theorem 1 (Clique number; upper bound). Let < p < 1 be a fixed constant and let 
d = l/(2p — p 2 ). Then, for almost every G G &(L n ,p), we have 

tu(G)^27(log d n) 2 . 

Since the model 9?(L n ,p) is different from &(L n , 1—p), we cannot immediately deduce 
a corresponding upper bound for the independence number. One way to find such a 
bound is to couple the model &(L n ,p) with &(L n , 1—p), and use Theorem 1 to deduce 
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that for almost every G G @(L n ,p), 

a(G)=u(G)^ 27 (log 1/(1 _ p2) n) 2 . 

In fact, using an argument similar to the one used in the proof of Theorem 1, we can 
obtain a slightly better result. 

Theorem 2 (Independence number; upper bound). Let < p < 1 be a fixed constant 
and let d = 1/(1 — p). Then, for almost every G G &(L n ,p), we have 



a(G) <27(log d n 



,2 



Recall that the (vertex) clique cover number 6(G) of a graph G is the smallest 
integer k such that the vertex set of G can be partitioned into k cliques. I.e. 9(G) = x(G)- 
So an immediate corollary of Theorem 1 is: 

Corollary 3 (Clique cover number; lower bound). Let < p < 1 be a fixed constant and 
let d = l/(2p — p 2 ). Then, for almost every G G &(L n ,p), we have 

27(log d n) 

Similarly, Theorem 2 implies: 

Corollary 4 (Chromatic number; lower bound). Let < p < 1 be a fixed constant and 
let d = 1/(1 — p). Then, for almost every G G &(L n ,p), we have 

x(G)>—^— 2 . □ 
27 (log d n) 

We now move to our upper bound on the chromatic number of random Latin square 
graphs. Recall that for constant p, the chromatic number of &(n,p) is whp asymptotic 
to 21o g - , where b = 1/(1 — p). A similar behaviour was proved in [17] for the case of 
random regular graphs of high degree. More specifically, it was proved that for any e > 0, 
if en ^ r ^ 0.9n, then the chromatic number of £f n)r is whp asymptotic to 21o ^ - , where 
b = n/(n — r). 

For the case of random Latin square graphs, we prove an upper bound of the same 
order of magnitude. However, since our lower bound is only of order ., n ., , we still do 

to ' J (log 6 n) 2 ' 

not have a sharp asymptotic result for the chromatic number. In fact, as in the case of the 
clique and independence numbers, we know that the chromatic number can depend on the 
sequence of Latin squares chosen. For example, the result of Green [12] mentioned above, 
that the independence number of the random Cayley sum graph on Z n (with p = 1/2) is 
whp 0(logn), provides a lower bound for the chromatic number of these graphs which is 
of the same order of magnitude as our corresponding upper bound. On the other hand, 
we claim that the chromatic number of the random Cayley sum graph on Z™ is whp 
0(; — ), where n = 2 m = |Z?M. The lower bound follows immediately from the 

v log n log log n /> I z I J 

result of Green [12] mentioned above for the independence number of these graphs. The 
upper bound does not follow directly from that result, however it follows from its proof 
in [12] that in fact there is whp a [logm + log log m — 1 J -dimensional subspace of Z™ 
which is an independent set. Indeed, given this result, it follows that whp, a random 
Cayley sum graph on ZIJ 1 can be partitioned into at most ? P 1 -, — independent sets of 

J J or- A r Jog n J g Jog n f 

this form. 
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In fact, our upper bound on the chromatic number will be an immediate consequence of 
an upper bound on the list-chromatic number. Recall that the list- chromatic number 
Xi{G) of a graph G is the smallest positive integer k such that for any assignment of 
fc-element sets L(v) to the vertices of G, there is a proper vertex colouring c of G with 
c(v) G L(v) for every vertex v of G. 

Theorem 5 (List-chromatic number; upper bound). Let < p < 1 be a fixed constant 
and let d = 1/(1 — p). Then, for almost every G G &(L n ,p), we have 

n 

Xl ° ^ \ \og d n-\ \og d \og d n - 2 ' 

Corollary 6 (Chromatic number; upper bound). Let < p < 1 be a fixed constant and 
let d = 1/(1 — p). Then, for almost every G G &(L n ,p), we have 

n 

i^og d n - ^\og d \og d n- 2 

With similar methods we will show the following upper bound for the clique cover 
number. 

Theorem 7 (Clique cover number; upper bound). Let < p < 1 be a fixed constant and 
let d = 1/p. Then, for almost every G G &(L n ,p), we have 

0(G) ^ . 

^\og d n- loglog d n - 6 

From Theorem 6 and Theorem 7 we deduce corresponding lower bounds on the inde- 
pendence and clique numbers. 

Corollary 8 (Independence number; lower bound). Let < p < 1 be a fixed constant 
and let d = 1/(1 — p). Then, for almost every G G &(L n ,p), we have 

a{G) ^ ^log d n -log log d n -6. □ 

Corollary 9 (Clique number; lower bound). Let < p < 1 be a fixed constant and let 
d = 1/p. Then, for almost every G G &(L n ,p), we have 

u(G)^^\og d n-^\og d \og d n-2. □ 

2.2. Expansion and related properties. Alon and Roichman [6] proved that random 
Cayley graphs on logarithmic number of generators are expanders whp. Our main result 
of this subsection states that a similar result holds in the case of random Latin square 
graphs. 

Before stating our result we need to introduce some notation. Given a multigraph G, 
its adjacency matrix is the 0,1 matrix A = A(G) with rows and columns indexed by 
the vertices of G, in which A xy is the number of edges in G joining x to y. If G is <i-regular 
then its normalised adjacency matrix T = T[G) is defined by T = \A. Note that 
T is a real symmetric matrix, so it has an orthonormal basis of real eigenvectors. We 
will write A ^ Ai ^ . . . ^ A n _! for the eigenvalues of T. It is easy to check that A = 1 
and that A n _i ^ — 1. We will write /i for the second largest eigenvalue in absolute value, 
i.e. \i = max{|Ai|, |A n _i|}. 

Finally, for < x < 1, we define 

H{x) =x\og (2x) + (l-x) log (2(1 -x)), 
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where we use the convention that all logarithms are natural. 

We can now state our main theorem. 

Theorem 10 (Second eigenvalue). Let L be an n x n Latin square with entries in [n] 
and let G G ^ m {L, k). Then, for every < e < 1, 

Pr(//(G) ^ e) < 2nexp j-Jfe# } < 2nex P { ""!"}• 

We remark that if L is the difference table of a group, then the above theorem is similar 
to the result of Alon and Roichman mentioned in the beginning of this subsection. The 
only difference is that the bounds appearing in the above theorem, are the same as the 
bounds appearing in the authors' proof [8] of the Alon-Roichman theorem and are slightly 
better than the original bounds of the Alon-Roichman theorem. 

Recall that a graph G is an (n,d,e)-expander if it is a graph on n vertices with 
maximum degree d such that for every subset W of its vertices of size at most n/2 we 
have |iV(W) \ W\ ^ e|W|, where N{W) denotes the neighbourhood of W . Note that for 
this definition we may ignore any multiple edges or loops that G may have. For more 
on expander graphs and their applications, we refer the reader to the recent survey of 
Hoory, Linial and Wigderson [13]. 

It is well known [26, 4] that a small second eigenvalue implies good expansion properties. 
The following corollary is an immediate consequence of Theorem 10 together with this 
fact. 

Corollary 11 (Expansion). For every 5 > 0, there is a c(5) > depending only on 5, 
such that almost every G G £f m (L n , c(S) logra) is an (n, 2c(S) logn, d~)-expander. □ 

The fact that the second eigenvalue of the graph is small implies that such a graph 
has several properties that many 'random-like' graphs possess. Informally, a graph of 
density p is pseudorandom if its edge distribution resembles the edge distribution of 
&(n,p). The study of pseudorandom graphs was initiated by Thomason in [27, 28]. 
Chung, Graham and Wilson [10] showed that many properties that a graph may possess, 
including the property of having small second eigenvalue, are in some sense equivalent to 
pseudorandomness. 

Here we list just a few of these consequences, mostly taken from the recent survey of 
Krivelevich and Sudakov [16]. We omit some of the proofs, but we note that some care 
needs to be taken since our graphs are multigraphs, while the result in the survey are 
stated only for simple graphs. 

To begin with, let us consider what value of k guarantees that almost every G G 
@(L n ,k) is connected. Let us first recall the corresponding results in &(n,p) and ^ n>r . 
It is well known that for any fixed 5 > 0, if p ^ (1 — 5)logn/n, then &(n,p) is whp 
disconnected, while if p ^ (1 + 5) log n/n, then @(n,p) is whp connected. On the other 
hand, £f n>r . is whp connected provided that r ^ 3. 

So what is the right threshold for the connectivity of random Latin square graphs? 
Once again this depends on the sequence (L n ) of Latin squares chosen. For example, the 
Cayley graph of Z g for q prime, with respect to any set S containing a non-trivial element 
is connected. On the other hand, the Cayley graph of G = with respect to any set of 
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size less than m = log 2 |G| is disconnected. Here, we prove that choosing slightly more 
elements are enough to guarantee whp the connectedness not only of the random Cayley 
graph of but in fact the connectedness of any random Latin square graph. 

Theorem 12 (Connectedness). For any fixed 5 > 0, almost every G G @(L n , (1 + 
5) log 2 n) is whp connected. 

Proof. It is enough to prove the result for < 5 < 1/2. Note that H(x) is continuous 
in (0, 1) and tends to log 2 as x tends to 1. Pick an x such that H(x) ^ (1 — 5/2) log 2. 
Then, for k — \S\ — (1 + 5) log 2 n, we have kH(x) ^ (1 + 5/4) logn. Thus, 

Pi(fi(G(L, S)) > 2x- 1) < 2nexp{-(l + 5/4) logn} = 2^ 5/4 = o(l). 

Thus whp, n(G(L, S)) < 2x — 1 < 1. It is well known that if /i(G) < 1 then G is 
connected, so the result follows. □ 

Let us now move to the vertex connectivity of random Latin square graphs. Recall 
that the vertex connectivity k(G) of a graph G is the minimal number of vertices that 
we need to remove in order to disconnect G. Clearly the vertex connectivity of any graph 
is at most its minimum degree 5(G). It is well known that for G G @(n,p) we have 
k(G) = 5(G). Recently, it was shown in [17, 11] that the same holds for random r-regular 
graphs provided 3 ^ r ^ n — 4. In our case, the Cayley graphs on Z™ show that no 
such result can hold if the generating set S has size less than log 2 n. Can we expect that 
such a result holds if the size of S is large enough? As the following example shows the 
answer is no. To understand the idea of the example, observe that if we can find two 
neighbouring vertices x,y in a rf-regular graph G with exactly d— 1 common neighbours, 
then removing these neighbours disconnects x and y from the other vertices of G and 
thus k(G) ^ d— 1. While in a random <i-regular graph this is very unlikely to happen, in 
the specific example that follows we can (deterministically) guarantee that the random 
Latin square graph is regular and furthermore its vertex set can be partitioned into pairs 
so that vertices in the same pair have exactly the same neighbours outside of this pair. 
It thus only remains to check that whp at least two vertices which belong to the same 
pair will be adjacent. 



Example. Define a Latin square L on {0, 1, 
1} as follows: 

L 



-l}x{0, 1} with entries in {0, 1, . . . ,2r- 



J (x,0),(y,l) 



J (x,i),(y,o) 



L 




if x ^ y 
if x > y 

if x ^ y 
if x > y 

if x ^ y 
if x > y 

if x ^ y 
if x > y 

Here, addition is done modulo 2r. It can be easily checked that L is indeed a Latin 
square. Pick any S C {0, 1, . . . , 2r — 1} and let G = G(L, S). Note that G is <i-regular for 
some d. Note also that for any x G {0, 1, ... ,r — 1}, we have that N G ((x, 0)) \ {(x, 1)} = 
N G ((x, 1)) \ {(x, 0)}, where N G denotes the neigbourhood of a vertex in G. But then, if 
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(x, 0) is adjacent to (x, 1) for some x, and G is not complete, we have that k(G) ^ d — 1. 
Indeed, Nq((x, 0)) \ {(x, 1)} is a disconnecting set of size d — 1. Now (x, 0) is adjacent to 
(x, 1) if and only if 2x + r G S 1 . Let p = p(r) G (0, 1) be chosen such that pr — > oo and 
(1 — p)r -> oo as r ->oo and choose S 1 by picking its elements independently at random 
with probability p. Then whp G is not complete and there is an x such that (x, 0) is 
adjacent to (x, 1) and so n(G) ^ 8(G) — 1. 

The above example shows that even if the size of S is large enough the vertex con- 
nectivity of a random Latin square qraph can be whp strictly smaller than its minimum 
degree. However, our next theorem shows that if S is large enough then the vertex con- 
nectivity of a random Latin square graph is whp at most one less than its minimum 
degree. 

Theorem 13 (Vertex connectivity). There is an absolute constant C ^ 168 such that 
whenever C log n ^ k ^ n/4, then 8(G) — 1 ^ n(G) ^ 5(G) for almost everyG G &(L n ,k). 

The example of Z™ shows that we cannot take C to be equal to 1. It would be 
interesting to know whether every C strictly larger than 1 works or not. It seems that 
our proof cannot bring the value of C down to 1 + 5 for any 5 > 0, so we have not tried 
to optimize the value of C that our proof gives. 

It should be noted that above result is not a direct consequence of the expansion 
properties of random Latin square graphs. From Theorem 10, we can only deduce that 
fj, = 0(sJ\ogn/k). However one can construct examples of (i-regular graphs on n vertices, 
with d = O(logn), /i = VL(\J\ogn/ d) but k(G) ^ d — f2(logn). We refer the reader to 
the discussion following [16, Theorem 4.1] for more details about how one can construct 
such a graph. 

Similar to the vertex connectivity, the edge connectivity X(G) of a graph G is the 
minimal number of edges that we need to remove in order to disconnect G. It is easy 
to show that k(G) ^ A(G) ^ 5(G). Hence, Theorem 13 applies with k(G) replaced by 
X(G). In fact, our next theorem shows that we can do a bit more. If \S\ ^ (1 + 5) log 2 n 
then whp the edge connectivity is equal to the minimum degree of G. In view of random 
Cayley graphs on Z^ 1 , this is in fact best possible. 

Theorem 14 (Edge connectivity). For any 5 > 0, if L is an n x n Latin square with 
entries in [n] and S is a set of (1 + 5)log 2 n elements of [n], chosen independently and 
uniformly at random, then whp, X(G(L, S)) = 5(G(L, S)) . 

Another graph property which follows from pseudorandomness is that of Hamiltonicity. 
Again, this property depends on the structure of the Latin square. For example, the 
Cayley graph of Zq for q prime, with respect to any non-trivial element is Hamiltonian. 
On the other hand, as it was mentioned earlier, the Cayley graph of G = Z™ with respect 
to any set of size less than m = log 2 \G\ is not even connected. A very appealing conjecture 
attributed to Lovasz, states that every connected Cayley graph is Hamiltonian. Together 
with Theorem 12, this would imply for example that every random Cayley graph on 
(1 + o(l))log 2 n generators is Hamiltonian. Pak (see [24]) conjectured that there is a 
constant c ^ 1 such that every random Cayley graph on (c + o(l)) log 2 n generators is 
Hamiltonian. However, even this consequence is still not known. Recently, Krivelevich 
and Sudakov [15] proved that every <i-regular graph on n vertices satisfying 

(log logn) 2 
^ 1000 log 7i(log log log n) ' 
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is Hamiltonian, provided n is large enough. Using this, together with the proof technique 
of the Alon-Roichman theorem, they proved that a random Cayley graph on 0((logn) 5 ) 
generators is whp Hamiltonian. Here, we extend this result to random Latin square 
graphs as well. Moreover, using Theorem 10 directly, we can in fact replace (logn) 5 by 
(logn) 3 . 

Theorem 15 (Hamiltonicity). If k = u ^ n (iog°og < n) 1 ° g ) > ^ en almost every G G 
&(L n ,k) is Hamiltonian. 1 

3. Cliques and independent sets 

We begin by finding upper bounds for the clique number of random Latin square graphs. 
Naturally, one would like to find a good upper bound for the expected number of <i-cliques 
of a random Latin square graph, and from this deduce a corresponding upper bound for 
the clique number. Given A C [n] let A' = {Lij : i, j G A,i ^ j}. If \A\ = d, then \A'\ 
can be as large as (f\ and as small as d — 1. In the former case, the probability that A 

forms a clique in &(L ni p) is (2p — p 2 )^ 2 \ However, in the latter case, this probability 
is at least p . So, unless one is able to bound the number of A C [n] for which \A'\ 
is relatively small, then this approach cannot give any good bounds. Our approach will 
be to show that any A C [n] of size d, has a subset B of size Q,(\fd), such that \B'\ is 
relatively large, i.e. f2(|.B| 2 ). By standard arguments it will then follow that whp (if d 
is large enough,) no such B forms a clique, and hence no A C [n] of size d forms a clique. 
Before stating our main lemma, we need to introduce some more notation. 

ri2(A) = \{{i, j} : i,j G A distinct and Ly = Lji}\; 
n^{A) = \{(i,j,k) : i,j,k G A distinct and = Ljk}\; 
n^A) = \{{{i,j), (k, /)} : k,l G A distinct and Ly = Lki}\. 

If x G A' appears exactly r x times as for distinct i, j G A, then, with the above 
notation, we have 



n 2 (A) + n 3 (A) + n,{A) = £ M 



xeA' 

We are now ready to state and prove our main lemma. 

Lemma 16. Let A be a set of elements of X of size a. Then for every b ^ a, A contains 
a subset B of size b such that 

Proof. For any B C A of size b, we have 

\B , \=b(b-l)-^(r x ■ 



xeB' 



>b(b-i)-j2 

xeB' 

= b(b-l)-n 2 (B)-n 3 (B)-n,(B). 
Picking B at random from all b element subsets of A, we have 

6(6-1X6-2) . 



E(n 3 (5)) = n 3 (A)|pgE|; and 



1 Recall that f(n) — uj(g{n)) means that f(n)/g(n) — > oo as n — > oo. 
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Fixing distinct i,j G A, there is exactly one k G [n] such that = Ljfc, hence n^(A) ^ 
a(a — 1). Similarly, fixing distinct i,j, k G A, there is exactly one L G [n] such that 
Lij = Lfri, hence n±(A) ^ ^ ^ f Q jj ows 

E(|-B'| + ?i 2 (B)) > b(b - 1) (l "- 2 f 6 - 2 " 6 - 3 ) 



a -2 2(a-3) 

and hence there is a choice of 5 satisfying the requirements of the lemma. □ 

We can now prove Theorem 1. 

Proof of Theorem 1. Let d = l/(2p — p 2 ), let 6 = 31og d n and let a = 3b 2 . Pick any 
A C [n] of size a. By Lemma 16, there is a B C A of size 6, such that 

^ h 2 -n 2 (B) + 0(b). 
6 

Pick pairs in B x B, with i 7^ j, such that all are distinct. Suppose that for 
exactly k of the pairs we have = L^. It follows that there are at least 

m ~ k) ~ ((2) " MB) ) = l b2 ~ k + 

sets {i,j}, such that both and (j, z) have been chosen (and so Ly 7^ L^). Therefore, 
the probability that B is a clique is at most 

k /o 2\ ^fe 2 -fe+0(fe) . /_ o\ hb 2 +0(b) 

p k (2p - p 2 ) 3 w < (2p - ?r) 3 w . 

So the expected number of cliques B C [n] of size b with \B'\ ^ |6 2 — 112(B) + 0(b) is at 
most 

(; ! ) (2 P -^)=" +o "» « i ( n (2 P -^)^ +o ' i ») t = o( i). 

Thus, by Markov's Inequality, we deduce that whp, no such B exists. By Theorem 16, 
it now follows that whp, there is no clique of size 3b 2 , as required. □ 

In a similar way, we can prove the upper bound for the independence number. 

Proof of Theorem 2. Let b = 31og 1 /( 1 _ p ) n, and let a = 3b 2 . Pick any A C [n] of size a. 
By Lemma 16, there is a B C A of size b, such that 

^ h 2 - n 2 {B) + 0(6) ^ V + 0(6). 
6 3 

Therefore, the probability that B is an independent set, is at most (1 — p) b2 / 3+ °( b \ and 
so the expected number of independent sets is 

("Vl -p) b2 /3+0( 6 ) ^ 1 ( n (l _ p )V3+0(l)^ = o(1) _ 

Thus, by Markov's Inequality, we deduce that whp, no such B exists. By Theorem 16 it 
now follows that whp, there is no independent set of size 36 2 , as required. □ 
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4. Colouring 

We now move to the proof of the upper bounds on the chromatic number. Before 
presenting our proof, let us see why a standard approach from the theory of random 
graphs does not seem to generalise in a straightforward manner. 

Suppose we could show that whp, every induced subgraph of G G &(L n , 1/2) on 
ni = n/(\ogn) 2 vertices has an independent set of size at least Si = (2 — e) log 2 n. It then 
follows immediately that whp, the chromatic number is at most n/si + nj ~ n/2(log 2 n). 
To do this, one usually shows that the probability that a given induced subgraph on 
ni vertices does not contain an independent set of size s± is 0(exp {— for some 
5 > 0. However in our model, this is far from being true. In fact, the probability that 
G E &(L n , 1/2) is empty is 2~ n , which is much larger than 0(exp {— n 1+<5 }). It turns out 
that this problem can be rectified by using the expansion properties of the graph G. We 
refer the reader to [3] to see how one can do this. Here, we will use a different approach 
from which we can obtain a better constant in the bound. 

Another approach for finding an upper bound for the chromatic number, is to analyse 
the greedy algorithm. This is the approach that we are going to use. This approach will 
in fact give an upper bound on the list-chromatic number as well. However, we need to 
modify the standard argument, because of the dependencies in the appearance of edges. 
In our modification we will make use of Talagrand's Inequality [25]. We will use the 
following version taken (essentially) from [14]. 

Talagrand's Inequality. Let X be a non-negative integer valued random variable, not 
identically 0, which is determined by n independent random variables and let M be a 
median of X . Suppose also that there exist K and r such that 

(1) X is K-Lipschitz. I.e. changing the outcome of one of the variables, changes the 
value of X by at most K. 

(2) For any s, if X ^ s, then there is a set of at most rs of the variables, whose 
outcome certifies that X ^ s. 



Then 



Pr(|X-M| ^ t) < 




ifQ^t^M; 
ift > M. 



In particular, it follows that, 

/■oo 

\EX-M\ ^E\X-M\ = / Pr(|X - M\ > t) dt 

Jo 



M 







^ 2KV8nrM + VorK 2 . 
Since also M = 2M Pr(X ^ M) < 2EX, we deduce that for < t < EX 

Pr(|X-EX| ^ t + lQrK 2 + 16iTv^EX) ^ 4 ex P { ~ Jer^EX } ' 
This is the form of Talagrand's Inequality that we will be using. 



Let us now proceed to the proof of Theorem 5. 
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Proof of Theorem 5. Let d = 1/(1 — p) and let u = ^ log d n—\ log d log d n — 2. Suppose 
every vertex u has a list L(v) of size [ri/wj . Fix an ordering v\, . . . ,v n of the vertices. 
Suppose we are given a (not necessarily proper) colouring c of vertices v i, . . . , t> m , such 
that c(i>i) G L(vj) for each 1 ^ i ^ m. Suppose L(v m+ i) = {x±, . . . ,x\n/ u \}i ^ Ci = c «( m ) 
be the number of times that colour Xj is used on vertices v i, . . . , v m and let A m+ i be the 
event that v m+1 has an earlier neighbour in every colour of the list L(v m+ i). We claim 
that Pr(v4 m+1 ) = o(l/n). Having proved this, we proceed by list-colouring the graph 
greedily. The probability that this fails is at most J2m=i P r (An) = o(l), so by Markov, 
we have whp Xi(G) ^ n/u. 

To prove our claim, let B ri = Bi(m) be the event that f m +i is joined with an earlier 
vertex of colour X{. Then clearly Pr(i^) ^ 1 — (1 —p) 2ci . Let Y be the number of colours 
in L{y m+ i) appearing on earlier neighbours of f m +i- Then 

ey < - - y (i - pf^ < - - -(i - p) 2 ™"/™ < - (i - (i - p) 2u ) , 

it z — ' u u u 

where the second inequality follows from the Arithmetic-Geometric Mean Inequality. Let 
X = Y - EY + 2(1 - (1 - p) 2u ) and let t = c*(l - p) 2u , for some < c < 1 to be 
determined later. Then X satisfies the conditions of Talagrand's Inequality with K = 2 
and r — 1. Note that, for n large enough, ^ t ^ EX, so 

Pr (\X - EX\ > c^(l - p) 2u + 64 + 32^(1 - (1 - p) 2u )^j ^ 

4 r c^i^i 4ex K 



64m 




c 2 log n 



16(1 -p) 8 log (1/(1 -p)) 

By elementary calculus, it is easy to show that 16x 8 log (1/x) ^ 2/e whenever < x < 
1. Hence, choosing any c with y/2/e < c < 1, we deduce that 

|X-EX| > c-(l -p) 2u + 64 + 32 A /-(l - (1-p)-) = o(l/n). 
In particular, since F ^ X, 

Pr (V ^ --c-(l-p) 2u + 64 + 32*/^) = o(l/n). 
\ « u \ U J 

Since 

it u(l — p)° 

we deduce that (for n large enough,) 

Pr(A m+1 ) = Pr(y ^ [n/«J) = o(l/n). □ 

Similarly, we can give an upper bound to the clique cover number. 

Proof of Theorem 7. Let d—l/p and let w = | log d n — log d log d n — 6. Fix an ordering of 
the vertices. Suppose we are given a not necessarily proper colouring of the first m vertices 
of G, using colours 1 up to [n/u\. Let c, = Cj(m) be the number of times colour i is used 
and let A m+ i be the event that the (m + l)-th vertex has a neighbour in every colour. 
We claim that Pr(A m+ i) = o(l/n). Having proved this, we colour the graph greedily. 
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The probability that we need more than \n/u\ colours is at most Ylm=i ^ >I i.A rn ) = °(1)> 
so by Markov, we have whp 6(G) = x{G) ^ n/u. 

To prove our claim, let Bi = B^m) be the event that the (m + l)-th vertex is joined 
(in G,) with an earlier vertex of colour i. Then clearly Pr(5j) ^ 1 — p c '. Let Y be the 
number of colours appearing on earlier neighbours of the (m + l)-th vertex. Then 



EY < - - V P c ^ - - -p mu ' n <: - (l -p" 

u u u u 



Let X = Y — EY + ^(1 — p u ) and let t = c^p u , for some < c < 1 to be determined 
later. Then X satisfies the conditions of Talagrand's Inequality with K = 2 and r = 1. 
Note that, for n large enough, ^ t ^ EX, so 



Tl I Tl 

Pr ( \X -EX\ ^ c-p u + 64 + 32W-(l - p u ) ) ^ 



c 2 np 2u 1 , f c 2 log n 
4 exp < > ^ 4 exp 



64m J *\ 32p 12 log(l/p) 

But 32a; 12 log (1/x) ^ 8/(3e) < 1 whenever < x < 1. Hence, choosing any c with 
a/8/3c < c < 1, we deduce that 



Tl Tl 

Pr ( \X -EX\ ^ c-p" + 64 + 32W-(l - p u ) ) = o(l/n). 



In particular, since Y ^ X, 



Pr [Y > - - c-p 2 " + 64 + 32W- = o(l/n). 
\ u u \ u J 

Since 

it up 12 
we deduce that (for n large enough,) 

Pr(A m+1 ) = Pr(F ^ [n/«J) = o(l/n). □ 

5. Expansion and consequences 

We now proceed to the expansion properties of random Latin square graphs and to 
the proof of Theorem 10 on the second eigenvalue of such graphs. In [8], we generalized 
Hoeffding's inequality, to an inequality where the random variables do not necessarily 
take real values, but instead take their values in the set of (self-adjoint) operators of a 
(finite dimensional) Hilbert space. We then used this inequality to give a new proof of 
the Alon-Roichman theorem. The main tool in the proof of Theorem 10 will be this 
Operator Hoeffding Inequality. Before stating the inequality, we need to introduce some 
more notation. 

Let V be a Hilbert space of dimension d, let A(V) be the set of self adjoint operators 
on V and let P(V) be the be the cone of positive operators on V, i.e. 

P(V) = {A G A(V) : all eigenvalues of A are nonnegative}. 

This defines a partial order on A(V) bjA^BiSB-Ae P(V). We denote by [A, B] 
the set of all C G A(V) such that A < C < B. We also denote by the largest 
eigenvalue of A in absolute value. 
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We can now state our Operator Hoeffding Inequality. We refer the reader to [8] for its 
proof. 

Theorem 17 ([8] Operator Hoeffding Inequality). Let V be a Hilbert space of dimension 
d and let Xi = E(X|J^) be a martingale, taking values in A(V), whose difference sequence 
satisfies Y{ G Then 

Pr(pf-EX|| ^ nh) < 2d exp {-nH (1/2 + h)}. 
Note that the case d = 1 of this inequality is exactly Hoeffding' s inequality. 

We now proceed to show that random Latin square graphs have small second eigenvalue 
and thus good expansion properties. 

Proof of Theorem 10. Let si,...,Sk be elements of [n] chosen independently and uni- 
formly at random. For s G [n] let L(s) be the 0,1 matrix in which L(s)ij = 1 if and only 
if /., 

elements is T 



, , - s. So the normalised adjacency matrix of the multigraph G generated by these 
k £ (L( Si ) + L( Sl f) 



i=i 



Let B = T — - J, where J is the n by n matrix 

having '1' in every entry. We claim that fi(G) = \\B\\. Indeed, if {vq,vi, . . . , i> n -i} is an 
orthonormal basis of T, with each V{ having eigenvalue Xi, and Vq 



1), then 



Bvo = and Bvi = Ajfj, so fi(G) = \\B\\ as required. Let Yi be the operator whose matrix 
is \ [L(si) + L(si) T — It is easy to check that Xi = Y\ + . . . + Yi is a martingale 

satisfying the conditions of the theorem. It follows that 



Pr(/x(G) ^ £ 




as required. 



□ 



Proof of Theorem 13. Let T be a minimial disconnecting set, so |T| ^ S(G) ^ 2k. Let U 
be the smallest component of G\T and let W = V(G) \ {U U T). Recalling that k ^ n/4, 
we see that \W\ ^ n/4. We claim that whp, \U\ ^ 128 log n. So let us assume that 
\U\ > 1281ogn and aim to obtain a contradiction. 

Our proof will be very similar to [16, Theorem 4.1]. In particular, we will use the edge 
distribution bound for pseudorandom graphs (see e.g. [16, Theorem 2.11]) which implies 
that for every subsets A, B of the vertex set of G we have 



2k 

e(A, B) - —\A\\B\ 
n 



< 2kfx(G)^/\A\\B\ 



Here e(A, B) denotes the number of edges between A and B counted with multiplicity. 
Note in particular that this means that edges in G[ifl B] are counted twice. 

Firstly, we deduce from Theorem 10 that whp 



KG) < 2 



logn 
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Since e(U, W) = 0, it follows from (1) that 

|17||W| < nfi(G)y/\U\\W\ 



and so 



\U\< KG Sf ^MG) 2 n^ 16nl ° gn 



\W\ ^ y ' k 

Using this together with (1) we get that 

Ob b 

e{U,U) < — \U\ 2 + 2kfi(G)\U\ < (32logn + Ay/klogn)\U\ ^ -\U\, 

where in the last inequality we used the assumption that k ^ Clogn, for some large 
enough C. It follows that 

Ik 

e(U,T) = 2k\U\-e(U,U)> Y \U\. 
On the other hand, using (1) once more, we have 



e(U,T) <: ^\U\\T\+2kKO)y/WW\< (^ + 4 y^[^J k \ U \ < f 

where we have used the facts that |T| ^ 2k, k ^ n/2 and the assumption that \U\ ^ 
128 log n. 

But this is a contradiction as we have proved that 3k\U\/2 ^ e(U,T) < 3k\U\/2. So 
we may assume that \U\ < 128 log n. 

We now claim that whp, the following holds: For any 3 distinct vertices x, y, z of G, 
\(N(x) UN(y)) \N(z)\ > 128 log n, where N(x) denotes the neighbourhood of the vertex 
x. Having proved this, it will follow that whp \U\ ^ 2 and so \T\ ^ 5(G) — 1. To see how 
this follows, observe that if U contains three vertices, say x, y, z, then the total number 
of their neighbours is whp bigger than 128 log n + |-/V(Y)|. In particular, there must be 
at least 1281ogn + |iV(z)| — \U\ > \N(z)\ vertices outside U which have one of x,y,z as 
their neighbour. But this implies that |T| > |iV(z)| ^ 5(G), contradicting the minimality 
of |T|. 

So, let x, y, z be distinct vertices of G. Let si, s 2 , ■ ■ ■ , Sfc be the elements of S chosen uni- 
formly at random and let X { = E(\(N(x)U N(y))\N(z)\\s 1 , ...,Si). Then X ,X 1 ,...,X k 
is a martingale with Lipschitz constant 4 and Xq is fc(l— 2/n). It follows by the Hoffding- 
Azuma inequality that 

Pr(|(JV(x) U N(y)) \ N(z)\ < k(l - 2/n) k - t) < exp <^ - — 

Now let t = ^/WkTogn and observe that if C and n are large enough then 128 log n ^ 
k(l — 2/n) k — t and so 

Pr(\(N(x) U N(y)) \ N(z)\ < 1281ogn) = 0(n~ A ). 

Our claim now follows from the union bound. This completes the proof of the theorem. 
(It can be checked that C = 168 works.) □ 



16 



DEMETRES CHRISTOFIDES AND KLAS MARKSTROM 



We omit the proof of Theorem 14, as it can be proved using a similar argument as 
in [16, Theorem 4.3] 

Sketch proof of Theorem 15. Firstly, one needs to check that the result of Krivelevich 
and Sudakov [15] mentioned before the statement of the theorem, also holds for ci-regular 
multigraphs. We omit the details of this check. Then the result follows directly from 
Theorem 10. □ 

6. Conclusion and open problems 

We have introduced new models of random graphs arising from Latin squares and 
studied some of their properties. There is still a lot of research that needs to be done 
even for many of the properties that we have considered here. 

Regarding the clique and independence numbers it would be interesting to know if 
the upper bound can be reduced further. In particular, we believe (but cannot prove) 
that the 2 in the exponent can be reduced further. It would be also interesting to know 
whether there are examples of random Latin square graphs whose clique/independence 
number is significantly larger than (log n log log n). It looks plausible that this is not 
the case. 

Similar remarks hold for the lower bound on the chromatic and clique cover numbers. 
Any improvement on the upper bound of the independence/clique numbers would give 
a corresponding improvement on the chromatic/clique cover numbers but it might be 
possible (or even easier) to get such improvements directly. 

Another interesting question which we have not been able to answer so far is the 
determination of the Hadwiger number of random Latin square graphs, i.e. the largest 
integer k such that the graph can be contracted into a K^. We do not even know, for 
p = 1/2 say, whether this number depends on the sequence of Latin squares or not. 

We have not studied at all the girth of random Latin square graphs. The reason is that 
it depends a lot on the structure of the Latin squares chosen. For example, almost every 
G £ £f(Z™,p) has whp girth 3, provided pn — > oo, where n = 3 m . On the other hand, 
we claim that almost every G £ £f(Z™,p) has whp girth strictly greater than 3 provided 
that pn 2 / 3 — > 0, where n = 2 m . Indeed, the expected number of triangles containing a 
fixed vertex x is ( n ~ 1 )p 3 which tends to 0. By Markov's inequality x is whp not contained 
in any triangle. But since the graph is vertex transitive, our claim follows. 

The expansion properties of random Latin squares imply that almost every G £ 
&(L n , clog 2 n), with c > 1, has logarithmic diameter. An interesting question here is 
the threshold for the diameter becoming equal to 2. It turns out that there are constants 
ci and C2 such that if p < c\ ^/logn/n, then almost every G £ &(L n} p) has diameter 
greater than 2, while if p > c 2 A/log n/n, then almost every G £ £f(L n ,p) has diameter 
less than or equal to 2. The values of c\ and C2 depend on the sequence of Latin squares 
chosen. Our results regarding the diameter will appear in a forthcoming paper [9]. 
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